Spatio-Temporal Action Localization For Human Action Recognition in Large Dataset

نویسندگان

Sameh MEGRHI

Marwa JMAL

Azeddine BEGHDADI

Wided Mseddi

چکیده

Human action recognition has drawn much attention in the field of video analysis. In this paper, we develop a human action detection and recognition process based on the tracking of Interest Points (IP) trajectory. A pre-processing step that performs spatio-temporal action detection is proposed. This step uses optical flow along with dense speed-up-robust-features (SURF) in order to detect and track moving humans in moving field of views. The video description step is based on a fusion process that combines displacement and spatio temporal descriptors. Experiments are carried out on the big data-set UCF-101. Experimental results reveal that the proposed techniques achieve better performances compared to many existing state-of-the-art action recognition approaches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

This paper introduces a video dataset of spatiotemporally localized Atomic Visual Actions (AVA). The AVA dataset densely annotates 80 atomic visual actions in 64k movie clips with actions localized in space and time, resulting in 197k action labels with multiple labels per human occurring frequently. The main differences with existing video datasets are: (1) the definition of atomic visual acti...

متن کامل

Robust and efficient models for action recognition and localization. (Modèles robustes et efficaces pour la reconnaissance d'action et leur localisation)

This thesis addresses the problem of action recognition, i.e ., how to determine the type of action that is happening in a video and its temporal localization. First, we consider the problem of video representation—how to encode videos in a robust way, such that the representation is suitable for a wide variety of action classes, tasks and video types. We present an extensive evaluation study t...

متن کامل

Genetic Programming-Evolved Spatio-Temporal Descriptor for Human Action Recognition

The potential value of human action recognition has led to it becoming one of the most active research subjects in computer vision. In this paper, we propose a novel method to automatically generate low-level spatio-temporal descriptors showing good performance, for high-level human-action recognition tasks. We address this as an optimization problem using genetic programming (GP), an evolution...

متن کامل

Human activity recognition in videos using a single example

a r t i c l e i n f o Bag of video words Hierarchical codebook Spatio-temporal contextual information Probabilistic modeling Context Ensemble of volumes This paper presents a novel approach for action recognition, localization and video matching based on a hierarchical codebook model of local spatio-temporal video volumes. Given a single example of an activity as a query video, the proposed met...

متن کامل

Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization

We propose a weakly-supervised structured learning approach for recognition and spatio-temporal localization of actions in video. As part of the proposed approach, we develop a generalization of the Max-Path search algorithm which allows us to efficiently search over a structured space of multiple spatio-temporal paths while also incorporating context information into the model. Instead of usin...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

Spatio-Temporal Action Localization For Human Action Recognition in Large Dataset

نویسندگان

چکیده

منابع مشابه

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

Robust and efficient models for action recognition and localization. (Modèles robustes et efficaces pour la reconnaissance d'action et leur localisation)

Genetic Programming-Evolved Spatio-Temporal Descriptor for Human Action Recognition

Human activity recognition in videos using a single example

Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization

عنوان ژورنال:

اشتراک گذاری